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DETAILED ACTION 



1 . Claims 1 -40 and 42-43 Pending. 



Response to Arguments 

2. Applicant's arguments, see response filed 8/14/2006, with respect to the 
rejection(s) of claim(s) 1-40 and 42-43 under USC 103(a) have been fully considered 
and are persuasive. Therefore, the rejection has been withdrawn. However, upon 
further consideration, a new ground(s) of rejection is made in view of Hansen in view of 
Chang et al. 

Note that all of the deficiencies found in the combination of Hansen and Bite! will 
be addressed in combination of Hansen and Chang et al., below. 

Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C, 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

• (a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

1 . Claims 1-40 and 42-43 rejected under 35 U.S.C. 103(a) as being anticipated by 
Hansen et al. ("Using navigation data to improve IR functions in the context of Web search", 
Proceedings of the tenth international conference on Information and knowledge management; Atlanta. 
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Georgia, USA; Pages 135-142; ACM Press. 2001 and referred to hereinafter as Hansen) in view of 
Chang et al. ("A Multi-Engine Search Tool with Clustering", 6^^ International World Wide Web 
Conference. April 1997 and referred to hereinafter as Chang). 

As per Claim 1 , Hansen discloses a system that refines a general-purpose 
search engine, comprising (i.e. "Traditional search engines like Lycos and Google now routinely 

return tens of thousands of resources per query,.. we propose narrowing search results by obsen/ing the 
browsing patterns of users during search tasks." The preceding text excerpt clearly indicates a system 
that narrows the search results of/refines a traditional search engines/a general purpose search engine 
(e.g. Lycos, Google).) (Page 135, Column 1, Paragraph 2). 

Hansen fails to disclose a component that identifies an entry point that includes a 
link utilized to access the general-purpose search engine and a tuning component that 
filters search query results of the general-purpose search engine based on criteria 
associated with the entry point. 

Chang discloses a component that identifies an entry point that includes a link 
utilized to access the general-purpose search (i.e. "The implementation of the system is based 
on the multi-engine search architecture. Under this architecture, the system receives a user's query as a 
list of keywords and redirects the query to other search engines (which include Altavista, Infoseek, Lycos, 
WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates that a link used to forward 
users queries to a remote commercial search engine (e.g. an entry point to a search engine) exists.) 
(Page 2. Paragraph 4) and a tuning component that filters search query results of the 

general-purpose search engine based on criteria associated with the entry point (i.e. "The 
feedback of the selected topics also helps to customize the knowledge base, since there is a percentage 
of negligible words, like people names or mistyped words that are not common to other queries and are 
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weighed high in the weighting. But with user's feedback, we can identify what are the terms to be 
stressed and what are the ones that are irrelevant to the query.,, feedback of the same keywords can be 
repeated to strengthen their significance." The preceding text excerpt clearly indicates that the query 
results are filtered based on information from the knowledge base, which is associated with the entry 
point, and are used to further refine the knowledge base.) (Page 6, Paragraph 2). 

It would have been obvious to one skilled in the art at the time of Applicants 

invention to modify the teachings of Hansen with the teachings of Chang to include a 

component that identifies an entry point entry point that includes a link utilized to access 

the general-purpose search and a tuning component that filters search query results of 

the general-purpose search engine based on criteria associated with the entry point with 

the motivation of forwarding search queries to search engines and refining the query 

results to direct the searching towards the intended purpose (Page 2, Paragraphs 4-5). 

-As per Claim 2, Hansen discloses the criteria comprising one or more of a 
document property, a context parameter, and a configuration (i.e. "...these schemes would 
involve the queries submitted by users together with the top L relevant pages returned by a given search 
engine." The preceding text excerpt clearly indicates that the criteria include queries submitted by 
users/keywords (e.g. a context parameter) and a URL (e.g. document property).) (Page 137, Column 1. 
Paragraph 4). 

As per Claim 3, Hansen discloses the document property comprising one or 
more of a term that appears on a web page, a property of a Uniform Resource Locator 
(URL) identifying the web page, a property of a plurality of URLs that link to the web 
page, a property of a plurality of web pages that link to the web page, and a layout (i.e. 
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". ..these schemes would involve the queries submitted by users together with the top L relevant pages 
returned by a given search engine. " The preceding text excerpt clearly indicates that the document 
property includes queries submitted by users/a term that appears on a web page and the name/property 
of a URL) (Page 137, Column 1, Paragraph 4). 

As per Claim 4, Hansen discloses tlie context parameter comprising one of a 
word probability and a probability distribution (i.e. "The group relation is captures by the triple (qu 
k, WiiJ, where k denotes a group ID and Wj^ is the probability that q-, belongs to group k. Then, for each 
group, we identify a number of relevant URLs. This is described by the triple (k, Up A„j) where uj is a URL 
and Akj is a weight that determines how likely it is that Uj is associated with the queries belonging to group 
/c/'The preceding text excerpt clearly indicates that a relevancy weight/probability distribution is used.) 
(Page 137. Column 2. Paragraph 4). 

As per Claim 5, Hansen discloses the tuning component provided with training 
data to learn what properties of a document are indicative of the document being 
relevant to a user executing a search query from the entry point (i.e. "To help guide the 
cluster process, we also introduce labeled data from an existing topic hierarchy that contains over 1.2 
million Web sites. . . We let the index j range from 1 to the number of URLs seen in the training data, J 
(which might include URLs from a content hierarchy). " The preceding text excerpt clearly indicates that 
training data is provided to help guide the cluster process (e.g. indicate what properties of a document are 
indicative of that document being relative to a user).) (Page 137, Column 2, Paragraph 4; Page 138, 
Column 1, Paragraph 1). 

As per Claim 6, Hansen discloses the tuning component configured to 
differentiate between a query result that is relevant to a search query context for a 
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group of users and a query result that is non-relevant to the search query context for the 
group of users (i.e. "Then, for each group, we identify a number of relevant URLs. This is described by 

the triple (k, Up Akj) where uj is a URL and Akj is a weight that determines how likely it is that Uj is 
associated with the queries belonging to group /c."The preceding text excerpt clearly indicates that each 
URL/query result is given a relevance weight associated with the group used for the search. If the URL is 
relevant, it is assigned a high relevancy score, and if the URL is non-relevant, it is assigned a low 
relevancy score.) (Page 137. Column 2, Paragraph 4). 

As per Claim 7, Hansen discloses the tuning component employs statistical 
analysis in connection with filtering the search query results (i.e. "As mentioned above, sets 
of such triples constitute the parameters in a statistical model for the search sessions. These triples can 
be used by a search engine to improve page rankings. " The preceding text excerpt clearly indicates that 
statistical modeling/analysis is used in connection with page ranking/filtering of the search results.) (Page 
138, Column 1, Paragraph 2). 

As per Claim 8, Hansen discloses the tuning component generates one or more 
context parameters for a received query result (i.e. 'The group relation is captures by the triple 
(Qh K v^ik)s v^here k denotes a group ID and Wjk is the probability that cjf/ belongs to group k. Then, for each 
group, we identify a number of relevant URLs. This is described by the triple (k, Uj, A^) where uj is a URL 
and At(f is a weight that determines how likely it is that Uj is associated with the queries belonging to group 
/(."The preceding text excerpt clearly indicates that a relevancy weight/context parameter is generated for 
a received query result.) (Page 137, Column 2. Paragraph 4), and then compare the generated 
context parameters with a relevant context parameter and a non-relevant context 
parameter to determine whether the query result is relevant (i.e. "The group relation is 
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captures by the triple (qi, k, WnJ, where k denotes a group ID and is the probability that qi belongs to 
group k, Then, for each group, we identify a number of relevant URLs. This is described by the triple (K 
Up Akj) where uj is a URL and A^j is a weight that determines how likely it is that Uj is associated with the 
queries belonging to group k."The preceding text excerpt clearly indicates that in order to determine if the 
query result is associated with a group (e.g. group k) the relevancy weight/context parameter is 
examined. Note that a high relevancy weight constitutes a relevant context parameter and a low 
relevancy weight constitutes a non-relevant context parameter. Further note that in order to determine if 
the query result is related to the group, the relevancy weight/context parameter for the query result must 
be compared to low and high relevancy weights.) (Page 137, Column 2, Paragraph 4). 

As per Claim 9, Hansen discloses the tuning component further ranks the query 
results (i.e. "Our clustering can also be used to modify the rankings of results from a traditional search 
engine." Ihe preceding text excerpt clearly indicates that the clustering/filtering is used to improve 
ranking/rank query results.) (Page 138, Column 1, Paragraph 3). 

As per Claim 10, Hansen discloses the ranking determined by the degree of 
relevance of the query result to a relevant data set and a non-relevant data set, the 
relevance is determined via one of a similarity measure and a confidence interval (i.e. 
"Then, for each group, we identify a number of relevant URLs. This is described by the triple (k, Uj, A^j) 
where uj is a URL and Akj is a weight that determines how likely it is that Uj is associated with the queries 
belonging to group k...As mentioned above, sets of such triples constitute the parameters in a statistical 
model for the search sessions. These triples can be used by a search engine to improve page rankings. " 
The preceding text excerpt clearly indicates that that ranking is determined by a relevancy score/similarity 
measure. assigned to the URL by comparing it to a group of queries (e.g. group k). Note that not all of the 
pages in the group that the URL is being compared to are relevant as some may have a low relevancy 
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score iri the set, therefore the group (e.g. group k) constitutes both a set of relevant data, and a set of 
non-relevant data.) (Page 137, Column 2, Paragraph 4; Page 138, Column 1, Paragraph 2). 

As per Claims 1 1 and 20, Hansen discloses the ranking order comprising one of 
ascending and descending, from the most relevant result to the least relevant result (i.e. 

"Here, we arrange the query groups and the URLs by weight, with the most relevant appearing at the 
fop. "The preceding text excerpt clearly indicates that the results are sorted in descending order, with the 
most relevant results at the top.) (Page 138. Column 1, Paragraph 2). 

As per Claim 12, Hansen discloses the tuning component configured for a 
plurality of entry points associated with one or more groups of users (i.e. "...we present 
three search sessions (each initiated by a different user)... "The preceding text excerpt clearly indicates 
that entry points for multiple groups of users may be defined.) (Page 137, Column 1, paragraph 3). 

As per Claim 13, Hansen discloses a system that tunes a general-purpose 
search engine, comprising (i.e. "Traditional search engines like Lycos and Google now routinely 

return tens of thousands of resources per query... we propose narrowing search results by observing the 
browsing patterns of users during search tasks." The preceding text excerpt clearly indicates a system 
that narrows the search results of/tunes a traditional search engines/a general purpose search engine 
(e.g. Lycos, Google).) (Page 135, Column 1, Paragraph 2): a filter component that parses 

relevant and non-relevant general-purpose search engine content results based on 
training data (i.e. "To help guide the cluster process, we also introduce labeled data from an existing 
topic hierarchy that contains over 1 . 2 million Web sites ...We let the index j range from 1 to the number of 
URLs seen in the training data, J (which might include URLs from a content hierarchy)... When a new user 
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initiates a search, we present them with a display of query groups most related to their search terms. For 
each such group, we select the most relevant URLs arranged in a display like that in Figure 4. " The 
preceding text excerpt clearly indicates training data is used to parse general search engine results for 
related query groups/an entry point,) (Figure 4; Page 137, Column 2, Paragraph 4; Page 138, Column 1, 
Paragraphs 1-2); and a ranking component that sorts tlie filtered results in accordance with 

the training data for presentation to a user (i.e. "To help guide the cluster process, we also 

introduce labeled data from an existing topic hierarchy that contains over 1.2 million Web sites... We let 
the index j range from 1 to the numtier of URLs seen in the training data, J (which might include URLs 
from a content hierarchy) . . . These triples can be used by a search engine to improve page rankings. " The 
preceding text excerpt clearly indicates that a ranking component is present which ranks data based on/in 
accordance with the content hierarchy/training data. Note that Figure 4 illustrated the results being 
displayed to a user.) (Figure 4; Page 137, Column 2, Paragraph 4; Page 138. Column 1, Paragraphs 1- 

2). • 

Hansen fails to disclose that the content results are for an entry point that 
provides a link employed to traverse the general purpose search engine. 

Chang discloses that the content results are for an entry point that provides a link 
employed to traverse the general purpose search engine (i.e. "The implementation of the 

system is based on the multi-engine search architecture. Under this architecture, the system receives a 
user's query as a list of keywords and redirects the query to other search engines (which include 
AltaVista, Infoseek, Lycos, WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates 
that a link used to forward users queries to a remote commercial search engine (e.g. an entry point to a 
search engine) exists.) (Page 2, Paragraph 4). 

It would have been obvious to one skilled in the art at the time of Applicants 

invention to modify the teachings of Hansen with the teachings of Chang to include that 

the content results are for an entry point that provides a link employed to traverse the 
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general purpose search engine witii the motivation of forwarding search queries to 
search engines and refining the query results to direct the searching towards the 
intended purpose (Page 2, Paragraphs 4-5). 

As per Claim 14, Hansen discloses the filter component parsing the results as a 
function of one or more of a document property, a context parameter, and a 
configuration associated with the entry point (i.e. ".. we consider improving search results by first 

forming groups of queries based on the similarity of their associated search sessions.. . This has the effect 
of reducing spurious associations between queries... these schemes would involve the queries submitted 
by users together with the top L relevant pages returned by a given search engine. " The preceding text 
excerpt clearly indicates that a filter component exists which parses the results using queries submitted 
by users/keywords (e.g. a context parameter) and a URL (e.g. document property).) (Page 137, Column 
1 , Paragraphs 3-4) 

As per Claim 15, Hansen discloses the filter component trained to differentiate 
between a relevant and a non-relevant result via the training data (i.e. "To help guide the 

cluster process, we also introduce labeled data from an existing topic hierarchy that contains over 1.2 
million Web sites.. . We let the index j range from 1 to the number of URLs seen in the training data, J 
(which might include URLs from a content hierarchy) . . . Then, for each group, we identify a number of 
relevant URLs. This is described by the triple (k, uj, Akj) where uj is a URL and Akj is a weight that 
determines how likely it is that Uj is associated with the queries belonging to group k, " The preceding text 
excerpt clearly indicates that, using training data, each URL/query result is given a relevance weight 
associated with the group used for the search. If the URL is relevant, it is assigned a high relevancy 
score, and if the URL is non-relevant, it is assigned a low relevancy score.) (Page 137, Column 2, 
Paragraph 4). 
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As per Claim 16, Hansen discloses the training data comprising a set of relevant 
data associated with a search context of a user for the entry point and a set of non- 
relevant data comprising random data unrelated to the search context of the user for the 
entry point (i.e. To help guide the cluster process, we also introduce labeled data from an existing topic 
hierarchy that contains over 1.2 million Web sites.. We let the index j range from 1 to the number of URLs 
seen in the training data, J (which might include URLs from a content hierarchy)." The preceding text 
excerpt clearly indicates that a set of training data, the portion of which that is associated with the search 
context of the user for the entry point constituting the set of relevant data, and the portion of which that is 
irrelevant to the search context of the user for the entry point constituting the set of non-relevant data, 
exists and are used to help determine the relevancy of a search result. Note that the set of unrelated 
data comes from an existing data hierarchy, and can therefor be considered random.) (Page 137, Column 
2, Paragraph 4; Page 138, Column 1, Paragraph 1). 

As per Claim 17, Hansen discloses the filter component employs statistical 
analysis to determine whether a result is relevant or non-relevant to the entry point (i.e. 

"As mentioned above, sets of such triples constitute the parameters in a statistical model for the search 
sessions. These triples can be used by a search engine to improve page rankings. " The preceding text 
excerpt clearly indicates that statistical modeling/analysis is used in connection with page ranking/filtering 
of the search results which determine relevancy. Note from above that the triple referred to contain a 
relevancy score.) (Page 138, Column 1, Paragraph 2). 

As per Claim 18, Hansen discloses the ranking component employs a technique 
to determine the degree of relevance of the query results with respect to a relevant data 
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set and a non-relevant data set (i.e. "To help guide the cluster process, we also introduce labeled 
data from an existing topic hierarchy that contains over 1.2 million Web sites... We let the index j range 
from 1 to the number of URLs seen in the training data, J (which might include URLs from a content 
hierarchy)." The preceding text excerpt clearly indicates that a set of, the portion of which that is 
associated with the search context of the user for the entry point constituting the set of relevant data, and 
the portion of which that is irrelevant to the search context of the user for the entry point constituting the 
set of non-relevant data, exists and are used to help determine the relevancy of a query result.) (Page 
137, Column 2, Paragraph 4; Page 138, Column 1, Paragraph 1). 



As per Claim 19, Hansen discloses the technique comprising one of a similarity 
measure and a confidence interval (i.e. "Then, for each group, we identify a number of relevant 

URLs. This is described by the triple (k, Uj, Akj} where uj is a URL and Akj is a weight that determines how 
likely it is that uj is associated with the queries belonging to group k...As mentioned above, sets of such 
triples constitute the parameters in a statistical model for the search sessions. These triples can be used 
by a search engine to improve page rankings." The preceding text excerpt clearly indicates that that 
ranking/relevancy is determined by a relevancy score/similarity measure assigned to the URL by 
comparing it to a group of queries (e.g. group k).)(Page 137, Column 2, Paragraph 4; Page 138, Column 
1, Paragraph 2). 

As per Claim 21, Hansen discloses the ranking performed on the relevant query 
results, wherein the non-relevant results are discarded (i.e. "For each group, we select the 

most relevant URLs arranged in a display like that in Figure 4. Here, we arrange the query groups and the 
URLs by weight, with the most relevant appearing at the top. " The preceding text excerpt clearly indicates 
that only the most relevant URLs/results are displayed, while the less or non-relevant results are 
discarded.) (Page 138, Column 1, Paragraph 2). 
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As per Claim 22, Hansen discloses a method to filter and rank general-purpose 
search engine results associated with an entry point, comprising (i.e. "These triples can be 

used by a search engine to improve page rankings. When a new user initiates a search, we present them 
with a display of query groups most related to their search terms. For each such group, we select the 
most relevant URLs arranged in a display like that in Figure 4. "The preceding text excerpt clearly 
indicates a method for filtering and ranking of general-purpose search engine results associated with an 
entry point. Note that the entry point is associated with the groups mentioned in the quotation and as 
referenced above.) (Page 138. Column 1, Paragraph 2); and ranking the general-purpose search 

engine results (i.e. "These triples can be used by a search engine to improve page rankings. When a 

new user initiates a search, we present them with a display of query groups most related to their search 
terms. For each such group, we select the most relevant URLs arranged in a display like that in Figure 
4." The preceding text excerpt clearly indicates that the general-purpose search engine results are 
ranked.) (Page 138, Column 1, Paragraph 2). 

Hansen fails to disclose executing a query search with the general purpose 
search engine accessed through a link associated with the entry point and filtering the 
general-purpose search engine results by tuning the general purpose search engine 
based on the entry point. 

Chang discloses executing a query search with the general purpose search 
engine accessed through a link associated with the entry point (i.e. "The implementation of 

the system is based on the multi-engine search architecture. Under this architecture, the system receives 
a user's query as a list of keywords and redirects the query to other search engines (which include 
AltaVista, Infoseek, Lycos, WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates 
that a link used to fonA/ard users queries to a remote commercial search engine (e.g. an entry point to a 
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search engine) exists.) (Page 2, Paragraph 4) and filtering the general-purpose search engine 
results by tuning the general purpose search engine based on the entry point (i.e. "The 

feedback of the selected topics also helps to customize the knowledge base, since there is a percentage 
of negligible words, like people names or mistyped words that are not common to other queries and are 
weighed high in the weighting. But with user's feedback, we can identify what are the terms to be 
stressed and what are the ones that are irrelevant to the query... feedback of the same keywords can be 
repeated to strengthen their significance," The preceding text excerpt clearly indicates that the query 
results are filtered based on information from the knowledge base, which is associated with the entry 
point, and are used to further refine the knowledge base.) (Page 6, Paragraph 2). 

It would have been obvious to one skilled in the art at the time of Applicants 

invention to modify the teachings of Hansen with the teachings of Chang to include 

executing a query search with the general purpose search engine through the entry 

point and filtering the general-purpose search engine results by tuning the general 

purpose search engine based on the entry point with the motivation of fonvarding 

search queries to search engines and refining the query results to direct the searching 

towards the intended purpose (Page 2, Paragraphs 4-5). 

As per Claim 23, Hansen discloses employing a statistical hypothesis to 
determine whether a result is relevant or non-relevant to a search context of the entry 
point (i.e. "As mentioned above, sets of such triples constitute the parameters in a statistical model for 
the search sessions. These triples can be used by a search engine to improve page rankings. " The 
preceding text excerpt clearly indicates that statistical modeling/a statistical hypothesis is used in 
connection with page ranking of the search results, which determines relevancy. Note from above that 
the triple referred to contain a relevancy score.) (Page 138, Column 1, Paragraph 2). 
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As per Claim 24, Hansen discloses the statistical hypothesis employing a 
threshold in connection with a probability distribution for relevant data and a probability 
distribution for non-relevant data (i.e. "Then, for each group, we ideritify a number of relevant 

URLs. This is described by the triple (k, Uj, Anj) where uj is a URL and A^j is a weight that determines how 
likely it is that uj is associated with the queries belonging to group k...As mentioned above, sets of such 
triples constitute the parameters in a statistical model for the search session. . .For each such group, we 
select the most relevant URLs..." The preceding text excerpt clearly indicates that a threshold to 
determine which are the most relevant URLs exists and that a relevancy weight/probability distribution of 
a document is calculated for each group, some of which are relevant to the search, and some of which . 
are not.) (Page 137, Column 2, Paragraph 4; Page 138. Column 1, Paragraphs 1-2), wherein 

respective word probabilities are generated for the search query results and compared 

to the threshold (i.e. "Then, for each group, we identify a number of relevant URLs. This is described 

by the triple (k, Uj, Akj) where uj is a URL and A^j is a weight that determines how likely it is that Uj is 
associated with the queries belonging to group k... For each such group, we select the most relevant 
URLs..." The preceding text excerpt clearly indicates that in order for a URL/result to be considered most 
relevant it's relevancy weight/probability distribution must be first calculated, then compared to the 
threshold.) (Page 137, Column 2, Paragraph 4; Page 138, Column 1, Paragraphs 1-2), the probability 

distribution for relevant data and the probability distribution for non-relevant data to 

determine whether the results are relevant or non-relevant (i.e. "Then, for each group, we 

identify a number of relevant URLs. This is described by the triple (k, Uj, Akj) where uj is a URL and Aig is a 
weight that determines how likely it is that Uj is associated with the queries belonging to group k. " The 
preceding text excerpt clearly indicates that the relevancy score/probability distribution is used to 
determine which results are relevant and which results are non-relevant.) (Page 137, Column 2, 
Paragraph 4). 
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As per Claim 25, Hansen discloses the threshold employed to bias the decision 
to mitigate one of a result being deemed non-relevant when the result is relevant and a 
result being deemed relevant when the result is non-relevant (i.e. "We capture the interesting 
part of the search path in a search session, which is the user's query together with the URLs of the Web 
sites they visit in response to their query... Then, for each group, we identify a number of relevant URLs, 
This is described by the triple (k, uj, Akj) where uj is a URL and A^j is a weight that determines how likely it 
is that Uj is associated with the queries belonging to group k...As mentioned above, sets of such triples 
constitute the parameters in a statistical model for the search sessions, " The preceding text excerpt 
clearly indicates that because the results are collected over many search sessions, and are collected 
from which URLs users visit, if one users visits a non-relevant URL in response to a search, this URL will 
only be determined as relevant in response to one search session. Unless the data from the collected 
search sessions also indicate the non-relevant URL, its relevancy weight will not be greatly affected, 
therefore mitigating/lessening the impact. Also not that, in much the same way, relevant URLs relevancy 
weights will not be greatly affected as a result of being deemed non-relevant in one particular search 
session. The data from many collected search sessions is needed to markedly influence a relevancy 
weight.) (Page 135, Column 2. Paragraph 2; Page 137, Column 2, Paragraph 4; Page 138, Column 1, 
Paragraph 2). 

As per Claim 26, Hansen discloses employing a probability distribution analysis 
or machine learning in connection with the filtering and ranking, wherein suitable 
probability distributions include a Bernoulli, a binomial, a Pascal, a Poisson, an arcsine, 
a beta, a Cauchy, a chi-square with N degrees of freedom, an Eriang, a uniform, an 
exponential, a gamma, a Gaussian-univariate, a Gaussian-bivariate, a Laplace, a log- 
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normal, a rice, a Weibull and a Rayleigh distribution (i.e. 'The same kind of Poisson structure 
usd for the collection of URLs in a search session is applied to the query terms. " The preceding text 
excerpt clearly indicates that a poisson distribution is used.) (Page 11, Column 1, Paragraph 6), and the 

machine learning can classify based on one or more of a word occurrence, a 
distribution, a page layout, an inlink, and an outlink (i.e. "A natural prior for our coefficients Ay 
(the relevance weights) is a Gamma distribution." The preceding text excerpt clearly indicates a 
distribution is used to classify the relevancies.) (Page 141, Column 1, Paragraph 5). 

As per Claim 27, Hansen discloses employing a statistical analysis to rank 
search query results (i.e. "As mentioned above, sets of such triples constitute the parameters in a 
statistical model for the search sessions. These triples can be used by a search engine to improve page 
rankings." The preceding text excerpt clearly indicates that statistical modeling/analysis is used in 
connection with page ranking of the search results.) (Page 138, Column 1, Paragraph 2). 

As per Claim 28, Hansen discloses the ranking comprising one of generating 
word probabilities and employing a confidence interval to determine relevance, and 
generating a similarity measure comprising one of a cosine distance, the Jaccard 
coefficient, an entropy-based measure, a divergence measure and/or a relative 
separation measure to determine similarity (i.e. "Then, for each group, we identify a number of 
relevant URLs. This is described by the triple (k, uj, At^j) where uj is a URL and Ai^ is a weight that 
determines how likely it is that Uj is associated with the queries belonging to group k. " The preceding text 
excerpt clearly indicates that a weight is given to each site to identify how closely it is related to a group k, 
and which is used in ranking. Note that the weights are based on the search terms, as in Figures 3 and 
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4. which therefore indicates that word probabilities are used to assign the weights and ranks,) (Figures 3, 
4; Page 137, Column 2, Paragraph 4). 

As per Claim 29, Hansen discloses a method to manually customize a general- 
purpose search engine for an entry point, comprising (i.e. "Traditional search engine, ..now 

routinely return tens of thousands of resources per query. . .it has typically required the user to report 
details of their search and manually tag pages according to their relevance.., "The preceding text excerpt 
clearly indicates that a general purpose search engine is manually customized to create an entry point 
(e.g. basis to narrow search results, and return more accurate results) by having users tag results for 
relevance.) (Page 135, Column 1, Paragraph 2; Page 137. Column 2, Paragraph 1): providing a set of 

relevant data to train a component to discern query results relevant to a search context 
of a user employing the entry point (i.e. To help guide the cluster process, we also introduce 
labeled data from an existing topic hierarchy that contains over 1.2 million Web sites.,. We let the index j 
range from 1 to the number of URLs seen in the training data, J (which might include URLs from a 
content hierarchy)." The preceding text excerpt clearly indicates a set of data, the portion of which is 
related to the query constituting the set of relevant data, which guides the cluster process/is used to 
discern query results relevant to a search context of a user employing the entry point. Note that URLs 
relevance will be measured by a relevancy score which pertains to how relevant the data is to the groups 
in the training set. A high relevancy score indicates a relevant URL.) (Page 137, Column 2, Paragraph 3); 

providing a set of non-relevant data to train the component to discern query results 
unrelated to the search context (i.e. To help guide the cluster process, we also introduce labeled 
data from an existing topic hierarchy that contains over 1.2 million Web sites,. . We let the index j range 
from 1 to the number of URLs seen in the training data, J (which might include URLs from a content 
hierarchy)." The preceding text excerpt clearly indicates a set of data, the portion of which is unrelated to 
the query constituting the set of non-relevant data, which guides the cluster process/is used to discern 
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query results non-relevant to a search context of a user employing the entry point. Note that URLs 
relevance will be measured by a relevancy score which pertains to how relevant the data is to the groups 
in the training set. A low relevancy score indicates an unrelated URL.) (Page 137, Column 2, Paragraph 

3) , the set of relevant data and the set of non-relevant data are manually provided; and 
determining whether a query result is relevant to the search context based on the set of 
non-relevant data (i.e. "To help guide the cluster process, we also introduce labeled data from an 
existing topic hierarchy that contains over 1.2 million Web sites.,. We let the index j range from 1 to the 
number of URLs seen in the training data, J (which might include URLs from a content hierarchy). " The 
preceding text excerpt clearly indicates that the set of relevant data and the set of non-relevant data are 
manually provided and employed to guide the clustering process/determine whether a query result is 
relevant to a search context.) (Page 137, Column 2, Paragraph 3). 

Hansen fails to disclose the entry point provides a link employed to access the 
general purpose search engine. 

Chang discloses the entry point provides a link employed to access the general 
purpose search engine (i.e. "The implementation of the system is based on the multi-engine search 
architecture. Under this architecture, the system receives a user's query as a list of keywords and 
redirects the query to other search engines (which include Altavista, Infoseek, Lycos, WebCrawler, 
Magellan, etc.). " The preceding text excerpt clearly indicates that a link used to fonward users queries to 
a remote commercial search engine (e.g. an entry point to a search engine) exists.) (Page 2, Paragraph 

4) . 

It would have been obvious to one skilled in the art at the time of Applicants 
invention to modify the teachings of Hansen with the teachings of Chang to include the 
entry point provides a link employed to access the general purpose search engine with 
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the motivation of fonA/arding search queries to search engines and refining the query 
results to direct the searching towards the intended purpose (Page 2, Paragraphs 4-5). 

As per Claim 30 and 35, Hansen discloses the set of relevant data comprising 
data a.ssociated with the search context of the user for the entry point (i.e. To help guide 
the cluster process, we also introduce labeled data from an existing topic hierarchy that contains over 1,2 
million Web sites... We let the index j range from 1 to the number of URLs seen in the training data, J 
(which might include URLs from a content hierarchy)." The preceding text excerpt clearly indicates that a 
set of data, the portion of which that is associated with the search context of the user for the entry point 
constituting the set of relevant data, exists.) (Page 137, Column 2, Paragraph 4; Page 138, Column 1, 
Paragraph 1). 

As per Claim 31 and 36, Hansen discloses the set of non-relevant data 
comprising random data unrelated to the search context of the*user for the entry point 

(i.e. 'To help guide the cluster process, we also introduce labeled data from an existing topic hierarchy 
that contains over 12 million Web sites. . . We let the index j range from 1 to the number of URLs seen in 
the training data, J (which might include URLs from a content hierarchy)." The preceding text excerpt 
clearly indicates that a set of data, the portion of which that is unrelated with the search context of the 
user for the entry point constituting the set of non-relevant data, exists.) (Page 137, Column 2. Paragraph 
4; Page 138, Column 1, Paragraph 1). 

As per Claim 32 and 37, Hansen discloses providing information to associate 
respective query results with the entry point (i.e. "Then, for each group, we identify a number of 
relevant URLs. This is described by the triple (l(, uj, A^j) where uj is a URL and Akj is a weight that 
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determines how likely it is that Uj is associated with the queries belonging to group k...For each such 
group, we select the most relevant URLs.,," The preceding text excerpt clearly indicates that the 
relevancy weight is used to associate query results with the relevant group/the entry point) (Page 138, 
Column 1, Paragraph 2). 

As per Claim 33 and 38, Hansen discloses the set of relevant data and the set of 
non-relevant data employed to train the component to learn the features that 
differentiate relevant data from non-relevant data (i.e. "To help guide the cluster process, we 

also introduce labeled data from an existing topic hierarchy that contains over 1.2 million Web sites,,. We 
let the index j range from 1 to the number of URLs seen in the training data, J (which might include URLs 
from a content hierarchy)." The preceding text excerpt clearly indicates that the set of relevant data and 
the set of non-relevant data are used as a set of training data to identify relevant data.) (Page 137, 
Column 2, Paragraph 4; Page 138, Column 1, Paragraph 1). 

As per Claim 34, Hansen discloses a method to automatically customize a 
general-purpose search engine for an entry point, comprising (i.e. "Traditional search 
engine... now routinely return tens of thousands of resources per query... we propose narrowing search 
results by observing browsing patterns of users during search tasks." The preceding text excerpt clearly 
indicates that a general purpose search engine is automatically customized to create an entry point (e.g. 
basis to narrow search results, and return more accurate results) by observing users during search 
tasks.) (Page 135, Column 1, Paragraph 2): recording a query result selected by a user as 

relevant (i.e. "We capture the interesting part of the search path in a search session, which is a user's 
query together with the URLs of the Web pages they visit in response to their query... by combining 
search sessions with queries in a given group, we can better identify relevant URLs." The preceding text 
excerpt clearly indicates that URLs/query results selected by a user are used as data to identify relevant 
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URLs, therefore some of them will be marked as relevant.) (Page 135, Column 2, Paragraph 2; Page 137, 
Column 1, Paragraph 3); recording a higher ranked query results, wherein a lower ranked 

result is selected by the user, as non-relevant {i.e. "„.PageRank is based on the amount of time 

a "random surfer" would spend on each page. " The preceding text excerpt clearly indicates that if the 
browser spends time on a given web page, it increases the relevancy of that web page (e.g. marks it as 
relevant) and decreases the relevancy of other pages, which may be considered more relevant (e.g. 
marks a relevant page as non-relevant).) (Page 137, Column 2, Paragraph 2); and providing the 

recorded results to autonnatically train the filter to discriminate between results relevant 
to a search context of the user for the entry point and results non-relevant to the search 
context (i.e. "We capture the interesting part of the search path in a search session, which is a user's 
query together with the URLs of the Web pages they visit in response to their query... Implicit in our 
approach is a form of query clustering that combines similar search terms on the basis of Web pages 
visited during a search session. These clusters are then used to improve the display of search engine 
results." The preceding text excerpt clearly indicates that previous query results and selections are 
recorded and added to the set of data used to discriminate between relevant results and non-relevant 
results.) (Page 135, Column 2, Paragraph 2). 

Hansen fails to disclose executing a query search via the entry point that 
includes a link employed to route to the general purpose search engine. 

Chang discloses executing a query search via the entry point that includes a link 
employed to route to the general purpose search engine (i.e. "The implementation of the 
system is based on the multi-engine search architecture. Under this architecture, the system receives a 
user's query as a list of keywords and redirects the query to other search engines (which include 
AltaVista, Infoseek, Lycos, WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates 
that a link used to fonA/ard users queries to a remote commercial search engine (e.g. an entry point to a 
search engine) exists.) (Page 2. Paragraph 4). 
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It would have been obvious to one skilled in the art at the time of Applicants 
invention to modify the teachings of Hansen with the teachings of Chang to include 
executing a query search via the entry point that includes a link employed to route to the 
general purpose search engine with the motivation of forwarding search queries to 
search engines and refining the query results to direct the searching towards the 
intended purpose (Page 2, Paragraphs 4-5). 

As per Claim 39, Hansen discloses the query results selected via a click thru 
technique employing a mouse to select a link associated with the query result by 
clicking on the link (i.e. "...users click through data to discover disjoint sets of similar URLs"The 
preceding text excerpt clearly indicates results and URLs are viewed using a click-through technique 
incorporating a mouse.) (Page 141. Column 1, Paragraph 2). 

As per Claim 40, Hansen discloses generating a word probability distribution for 
the relevant recorded results and a word probability distribution for the non-relevant 
recorded results (i.e. To help guide the cluster process, we also introduce labeled data from an 

existing topic hierarchy that contains over 1.2 million Web sites... We let the index j range from 1 to the 
number of URLs seen in the training data, J (which might include URLs from a content hierarchy) . . . Then, 
for each group, we identify a number of relevant URLs. This is described by the triple (k, Up Ai,j) where uj 
is a URL and A^j is a weight that determines how likely it is that Uj is associated with the queries belonging 
to group k."7he preceding text excerpt clearly indicates that a weight is given to each site to identify how 
closely it is related to a group k, and which is used in ranking. Note that the weights are based on the 
search terms, as in Figures 3 and 4, which therefore indicates that word probabilities are used to assign 
the weights and ranks. Note that the weights are generated for each group and that the data set is 
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comprised of a portion that is associated with the search context of the user for the entry point and 
constitutes the set of relevant data, and a portion that is irrelevant to the search context of the user for the 
entry point and constitutes the set of non-relevant data.) (Figures 3, 4; Page 137, Column 2, Paragraph 
4). 

As per Clainn 42, Hansen discloses a computer readable medium storing 
computer executable components that tunes a general-purpose search engine to 
improve context search query results, comprising (i.e. "Traditional search engines like Lycos 

and Google now routinely return tens of thousands of resources per query. .,we propose narrowing 
search results by observing the browsing patterns of users during search tasks. " The preceding text 
excerpt clearly indicates a system that narrows the search results of/tunes a traditional search engines/a 
general purpose search engine (e.g. Lycos. Google).) (Page 135, Column 1, Paragraph 2): a 

component that filters the general-purpose search engine results based on training data 
sets associated with the search context of a user (i.e. To help guide the cluster process, we 
also introduce labeled data from an existing topic hierarchy that contains over 1.2 million Web sites. ..We 
let the index j range from 1 to the number of URLs seen in the training data, J (which might include URLs 
from a content hierarchy)... When a new user initiates a search, we present them with a display of query 
groups most related to their search terms. For each such group, we select the most relevant URLs 
arranged in a display like that in Figure 4. " The preceding text excerpt clearly indicates training data 
associated with the search context of a user is used to filter general purpose search engine results.) 
(Figure 4; Page 137, Column 2, Paragraph 4; Page 138, Column 1, Paragraphs 1-2); and a 

component that ranks the general-purpose search engine results according to the 
similarity of the search engine results to the training data sets (i.e. "To help guide the cluster 
process, we also introduce labeled data from an existing topic hierarchy that contains over 1.2 million 
Web sites.. . We let the index j range from 1 to the number of URLs seen in the training data, J (which 
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might include URLs from a content hierarchy)... These triples can be used by a search engine to improve 
page rankings." The preceding text excerpt clearly indicates that a ranking component is present which 
ranks data based on/in accordance with similarity to the content hierarchy/training data.) (Page 137, 
Column 2, Paragraph 4; Page 138, Column 1, Paragraphs 1-2). 

Hansen fails to disclose an entry point that provides a link utilized to arrive at the 

general-purpose search engine. 

Chang discloses an entry point that provides a link utilized to arrive at the 
general-purpose search engine (i.e. "The implementation of the system is based on the multi- 
engine search architecture. Under this architecture, the system receives a user's query as a list of 
keywords and redirects the query to other search engines (which include Altavista, Infoseek, Lycos, 
WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates that a link used to forward 
users queries to a remote commercial search engine (e.g. an entry point to a search engine) exists.) 
(Page 2, Paragraph 4). 

It would have been obvious to one skilled in the art at the time of Applicants 

invention to modify the teachings of Hansen with the teachings of Chang to include an 
entry point that provides a link utilized to arrive at the general-purpose search engine 
with the motivation of forwarding search queries to search engines and refining the 
query results to direct the searching towards the intended purpose (Page 2, Paragraphs 4- 
5). 

As per Claim 43, Hansen discloses a system that filters and ranks general- 
purpose search engine results, comprising (i.e. "These triples can be used by a search engine to 

improve page rankings. When a new user initiates a search, we present them with a display of query 
groups most related to their search terms. For each such group, we select the most relevant URLs 
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arranged m a display like that in Figure 4. " The preceding text excerpt clearly indicates a method for 
filtering and ranking of general-purpose search engine results associated with an entry point. Note that 
the entry point is associated with the groups mentioned in the quotation and as referenced above.) (Page 
138, Column 1. Paragraph 2): means for filtering general-purpose search engine results to 

determine whether a query result is relevant to a search context of a group of users (i.e. 
",..we consider improving search results by first forming groups of queries based on the similarity of their 
associated search sessions... This has the effect of reducing spurious associations between queries." The 
preceding text excerpt clearly indicates that search results from a search engine are improved/filtered 
based on search context associated with a group of users.) (Page 137, Column 1, Paragraphs 3-4), and 

means for ranking the general-purpose search engine results based on a relevance of 
the general-purpose search engine results to the search context of the group of users 

(i.e. "These triples can be used by a search engine to improve page rankings. When a new user initiates 
a search, we present them with a display of query groups most related to their search terms. For each 
such group, we select the most relevant URLs arranged in a display like that in Figure 4. " The preceding 
text excerpt clearly indicates that the general-purpose search engine results are ranked based on 
relevance to the search terms/context of the users.) (Page 138, Column 1, Paragraph 2). 

Hansen fails to disclose an entry point that includes a link employed ot navigate 
to the general purpose search engine and means for ranking the general-purpose 
search engine results based on a relevance of the general-purpose search engine 
results to the search context of the entry point. 

Chang discloses an entry point that includes a link employed to navigate to the 
general purpose search engine (i.e. "The implementation of the system is based on the multi- 
engine search architecture. Under this architecture, the system receives a user's query as a list of 
keywords and redirects the query to other search engines (which include Altavista, Infoseek, Lycos, 
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WebCrawler, Magellan, etc.). " The preceding text excerpt clearly indicates that a link used to forward 
users queries to a remote commercial search engine (e.g. an entry point to a search engine) exists.) 
(Page 2. Paragraph 4) and means for ranking the general-purpose search engine results 

based on a relevance of the general-purpose search engine results to the search 

context of the entry point (i.e. "The feedback of the selected topics also helps to customize the 

knowledge base, since there is a percentage of negligible words, like people names or mistyped words 
that are not common to other queries and are weighed high in the weighting. But with user's feedback, 
we can identify what are the terms to be stressed and what are the ones that are irrelevant to the 
query... feedback of the same keywords can be repeated to strengthen their significance. " The preceding 
text excerpt clearly indicates that the query results are filtered based on information from the knowledge 
base, which is associated with the entry point, and are used to further refine the knowledge base.) (Page 
6, Paragraph 2). 

It would have been obvious to one skilled in the art at the time of Applicants 
invention to modify the teachings of Hansen with the teachings of Chang to include an 
entry point that includes a link employed to navigate to the general purpose search 
engine and means for ranking the general-purpose search engine results based on a 
relevance of the general-purpose search engine results to the search context of the 
entry point with the motivation of fonA/arding search queries to search engines and 
refining the query results to direct the searching towards the intended purpose (Page 2, 
Paragraphs 4-5). 



2. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
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§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 

Points of Contact 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael J. Hicks whose telephone number is (571) 272- 
2670. The examiner can normally be reached on Monday - Friday 8:30a - 5:00p. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Jeffrey Gaffin can be reached on (571) 272-4146. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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